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FACTORIZATION OF A MATRIX DIFFERENTIAL OPERATOR 
USING FUNCTIONS IN ITS KERNEL 


Abstract. Just as knowing some roots of a polynomial allows one to factor it, 
a well-known result provides a factorization of any scalar differential operator 
given a set of linearly independent functions in its kernel. This note provides 
a straight-forward generalization to the case of matrix coefficient differential 
operators. 


1. Motivation 

An ordinary differential merator (ODO) is a commonly used mathematical ob¬ 
ject that turns one functioiu into another by adding up the products of some other 
specified functions with its derivatives. Symbolically, an ODO is a polynomial in 
“9” having coefficients that are functions of x. The highest power of d appear¬ 
ing with a non-zero coefficient is called the order of the operator. ODOs act on 
functions of x according to the rule: 

n n 

(1) L = '^a^{x)d" implies L{f) :^'^a^{x)f^"\x). 

i=0 2—0 

Much interest in differential operators comes from their use in writing linear 
differential equations. However, ODOs also have the algebraic structure of a non- 
commutative ring. They can be added as one would add any polynomials, by 
combining the coefficients of similar powers of 9, and multiplication is defined by 
extending the following rule for the product of two monomials linearly over sums: 

m / \ 

(2) (a(x)9’") o {^(x)d^) = X P(x)9™+”-b 

i=o ^ ^ 

Although this definition may look complicated at first, in fact it is simply a con¬ 
sequence of the usual “product rule” from calculus, applied here to guarantee that 
multiplication corresponds to operator composition as one would expect. That is, 
this definition was chosen so that L o Q{j') = L{Q{f)). 

Given the rule @ for multiplication, the inverse question of factorization nat¬ 
urally arises. One factorization method for ODOs is surprisingly reminiscent of a 
familiar fact about polynomials. If a; = A is a root of the polynomial p{x)^ then you 
know that it has a factor of a; — A, the simplest first degree polynomial with this 
property. Similarly, for any non-zero function /(a;), 9 — f'/f is the simplest first 
order differential operator having / in its kernel, and if L is any ODO with / in 
its kernel then L = Q o [d — f'/f) for some differential operator Q. Moreover, just 
as knowing additional roots of the polynomial p would allow further factorization, 
one may factor a differential operator of order n into the product of operators n — k 

^Here and throughout this note, “functions of x” will be understood to mean ^^sufficiently 
differentiable functions of x” even if differentiability is not specifically mentioned. 
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and k from the knowledge of k linearly independent functions in its kernel. The 
general statement written in terms of Wronskian determinant^! is as follows: 

Theorem 1 (Scalar Case). Let (j)i, ■ ■ ■ ■, (j^m be funetions sueh that Wr(())i,..., (pm) ^ 
0. Then (a) the unique monic differential operator K of order m satisfying K{cj)i) = 
0 for 1 < i < m is the one whose aetion on an arbitrary funetion f{x) is given by the 
formula K{f) = Wr(())i,..., (pm, /)/Wr(())i,..., pm) and (b) if L is any differential 
operator satisfying L{pi) = 0 for 1 < i < m then there exists a differential operator 
Q sueh that L = Q o K. 


2. The Matrix Case 

The purpose of this note is to generalize this well-known and useful result to 
the case of matrix coefficientt[!. A matrix coefficient ordinary differential operator 
(MODO) is again a polynomial in d with coefficients that depend on x, but we now 
consider the case in which those coefficients are N x N matrices. The formulas 
for multiplying MODOs and for applying them to functions remain the same (see 
m and but now the products involving the coefficients are understood to be 
matrix products and the function / is an A^-vector valued function. 

Matrix analogues of Theorem[T](a) already appear in the literature. For example, 
a result of Etingof-Gelfand-Retakh [T] allows one to produce a monic MODO with a 
specified kernel using quasi-determinants. However, not only is there no published 
analogue of Theorem [T](b) for MODOs, it appears that many researchers have 
suspected that it does not generalize nicely to the matrix case. A common proof of 
Theorem [T](b) depends on the fact that any non-zero ODO of order at most n has a 
kernel of dimension at most n. So, the fact that any MODO with a singular leading 
coefficient has an infinite-dimensional kernel is both an obstacle to generalizing the 
proof and reason to doubt the validity of the equivalent statement for MODOs. 

It is therefore good news that Theorem [^Kb) below does indeed fully generalize 
the scalar result to the matrix case without imposing any additional restrictions on 
the leading coefficient, order or kernel of the operator L. In addition to proving 
this new result, what follows can be seen as providing a novel alternative proof to 
Theorem [1] and Theorem [21(a). 

Definition. Let pi{x ),..., Pmn{x) be A-vector valued functions and let $ be the 
MN X MN block Wronskian matrix 


/ ^1 

p2 

pMN \ 

4‘mn 




to r—^ 

1 

XMN / 


Note that the first A rows of are given by the selected vector functions and 
then every other element is the derivative of the element A rows above it. In the 
case A = 1, the non-singularity of this matrix is famously equivalent to the linear 
independence of the functions, but that is not true in general. For instance, if 

^The Wronskian determinant Wr(</>i,..., <f>n) of the functions <f>i{x) is defined to be the deter¬ 
minant of the n X n matrix having (pjix) in row i and column j. 

^Another sort of higher-dimensional generalization is the case of partial differential operators, 

which is considered in [2]. 
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N = 2, M = \, (f>i = (1 I)"*" and (j )2 = {x x)^, then det(<i>) = 0 even though these 
are linearly independent functions of x. 


For the sake of brevity, the following notations will be utilized below. The 
symbol 4> will denote the N x MN matrix ((/)i ■ • ■ (^mn) and D{4>) = 0 will be 
used as a shorthand for the statement that D{(j)i) is equal to the zero vector for 
each 1 < i < MN (i.e. that the vector functions are in the kernel of some linear 
operator D). The N x N identity matrix will be written simply as I and more 
generally the m x m identity matrix for any natural number m will be denoted by 
■ 

The main result can now be stated concisely as: 


Theorem 2 (Matrix Case). If the functions (fi axe chosen so that det $ 0 then 

(a) the differential operator 


(4) 


K = Id’^^ 



( 


I 

dl 


\ 




is the unique monic MODO K of order M such that K{(f>) — 0 and (b) if L is any 
MODO such that L{(fl) — 0 then there exists a MODO Q such that L = Q o K. 


Proof. For a natural number m > M, let denote the {mN + N) x {MN + N) 
matrix with block decomposition 



(^2 

4>mn 

I \ 

4 

4 • 

^MN 

dl 


44'’ • 

• 441 


It is a nearly trivial observation that for a MODO L of order at most m one has 

m 

(5) L = {ao ai am) L{(j)MN) L). 

That is, for any choice of L the product of the N x {mN + N) matrix ot made from 
its coefficients with has the vector L{(l)i) as its column for 1 < * < MN and 
the last N X N block is a copy of the operator itself, and conversely for any choice 
of X {mN + N) matrix a its product with yields a matrix that records a 
differential operator L and its action on each of the vector functions as in ([S]). 

In the case that m = M and the N x N blocks ai are defined by 

a = (ao • • • am) = “ 4>^mn /) ’ 

the product in © would equal 

(- 4 ^^ ... - 4421 ) = (- 4 ^^ l) qmj) = (0 K) 

for some MODO K. Because the MODOs in the block B are of degree at most 
M — 1, K is monic of degree M. We know that K{cf)) = 0 since the first MN 
columns, which are all zero, record its action on the functions (fi. This operator 
K can equivalently be produced with the formula @, as the quasi-determinant 
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|^m|m+i.m+i where <i>M is viewed as an (M + 1) x {M + 1) matrix with entries 
that are N x N matrices or as the Schur complement of the invertible block $ in 
the matrix This demonstrates the existence of the operator K promised in 

(a). (This could also have been achieved in many other ways, including merely by 
direct computation of K{(j)i), but doing it this way sets us up nicely for proving the 
rest of the claim.) 

Let G be the {mN + N) x {mN + N) matrix whose decomposition into N x 
{mN + TV) blocks 



fGo\ 


/ Go \ 


Gi 


Gi 

G = 


is such that 



U-/ 


\Gm-i ) 


and for i > M the block row Gi = (gio gn ■ ■ ■ gim) where the functions gij are 
defined by the equation o K = gij{x)dG Consider the product G^m- 

Its first MN rows would have the form {Imn B) since $“^<1) = Imn- For i > M, 
the product of block row Gi with is designed so that the MODO 5*“^ o K 
shows up as its last TV x TV block. According to ([5]), the previous columns would 
be the image of 4> under the action of that operator, but 3*“^ o = 0 so 


( 6 ) 


G^m = 


( Imn 
0 
0 

Vo d' 


B 

K 

d o K 

n — M 


K) 


Now, suppose L is any MODO of order at most m satisfying T(</>) = 0. Label 
its coefficients as in ([5]) and consider the product of the TV x (mTV + TV) matris0 
q = (ao • • • am)G~^ and the matrix \I/ = G^m- On the one hand, because the 
G~^ and G cancel, = (ao • • • am)^m is equal to the expression on the right 

in ©. Specifically, given the assumption = 0 it has the form (0 • • • 0 L). 

On the other hand, defining the TV x TV functions qi by the block decomposition 
Q = {qo • • • Qm) and making use of the block form of = G^m in ® it is clear 
that qff is also equal to (qp qi ... qM-i L), because the block Imn picks out and 
preserves the precisely the first M coefficient blocks. 

Combining these two observations, we conclude that qi = 0 for 0 < T < Tlf — 1. 
Considering only the last block column in the product q’®', one obtains an expression 
for L as a sum involving the functions qi as coefficients multplied by the operators 
in the last block column of G^m, but given the fact that the first M of those 
coefficients are zero this simplifies to: 


m /m — M \ 

T = ^ q. {d^-^ oK)=i ° K. 

i—M \ z—0 / 

Then the operator in parentheses above is the operator Q satisfying the claim in 
(b). 


^ The matrix G is invertible because its top-left MN x MN block ($ is invertible and 


below that it is lower-triangular with I’s along the diagonal. 
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Finally, suppose K was also a monic MODOs of order M such that K{<i)) = 0. 
Then D = K — K is an operator of order strictly less than M with this same 
property. The only way that D can have order less than M and also satisfy D = 
Q o K for some MODO Q is it Q = D = 0, which demonstrates the uniqueness of 
K and completes the proof. □ 


3. Example 


Consider the case M = N = 2 and the vector functions 

= (; ) , <^3 = (°) , and </.4 = (j) • 
Then the block Wronskian matrix in ([3|) is the matrix 


/ 


0 


—x^ 

0 

X 

0 

3a:^ 

2a; 

0 

0 

3a:^ 

0 

1 

oy 


whose invertibility assures us that there is a unique monic MODO of order 2 having 
all four of these vectors in its kernel. Using (|4]) we can easily determine that this 
operator is 


K = Id^ + 





Another MODO that obviously also has each (pi in its kernel is 


L = 


d^. 


Especially since the leading coefficient of L is non-zero and singular, a situation 
that cannot arise in the case = 1, without the theorem above it would not be 
clear that there is an algebraic relationship between L and K. However, Theorem[2] 
assures us that any MODO having these vectors in its kernel must have K as a 
right factor, and so there must be a differential operator Q such that L = Q o K. 
(In fact, one can check that 



realizes this factorization.) 
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